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CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application derives priority from Provisional Patent Application No. 
60/153,569 filed on 13 September 1999. 

BACKGROUND OF THE INVENTION 

1 . Field of the invention 

The present invention relates to Boolean set operations on data sets and, more 
particularly, to an improved method and software process for Boolean set intersection and set 
union operations among polygon-represented data sets using a digital computer, 

2, Description of the Background 

Boolean set operation generation is a foundational computational capability in a wide 
range of diverse problem domains, including image processing, spatial data analysis, 
constraint-based reasoning, earth resource evaluation, crop management, market analysis studies, 
micro-fabrication, mining, weather forecasting, military planning, and utility management. For 
example, rain forest shrinkage over time can be studied by performing Boolean set intersections 
between processed earth resource imagery and historical vector-represented geo-spatial map 
products depicting vegetation. The determination of regions "within 100 miles of Oklahoma 
City possessing a slope less than 3 degrees where wheat is grown" can be determined by 
performing Boolean set intersections between a circular region with radius 100 miles about 
Oklahoma City, a map product depicting slopes, and a land use product depicting agricultural 
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crops. Ground-based target locations for military applications can often be significantly refined 
by intersecting sensor generated error ellipses with domain features that favor the presence of 
such vehicles (roads, high mobility regions, and regions that afford cover and concealment), 
while de-emphasizing regions that do not favor such vehicles (swamps and waterways). Each of 
these examples relies on Boolean set operation computation among potentially large 2-D 
spatially-organized data sets. All Geographic Information Systems (GIS) and modern database 
management systems (DBMS) provide direct support to two-dimensional Boolean set operation 
generation. 

For polygon-represented data sets, the key Boolean set operations of intersection and 
union have been typically generated using algorithms from a field of mathematics known as 
computational geometry. Due to the highly combinatorial nature of these methods, 
computational requirements tend to increase rapidly as a function of the "size" of the data sets 
(i.e., the number of vertices in the polygon representations). Further dramatic increases in 
computational requirements occur when the polygons are multiply-connected (e.g., polygons 
with included holes or regions that consist of a collection of non-intersecting smaller regions). 
As a direct result of the inherent combinatorial computational requirements, large-scale dynamic 
applications have tended to be impractical Even highly parallel implementations, as well as 
significant improvements in hardware performance cannot effectively overcome the staggering 
computational requirements of some prospective applications. 

Traditionally, two-dimensional objects could be represented by one of two generally 
accepted spatial data structures: either the region quadtree or tuple-based polygon 
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representations- The region quadtree spatial data structure provides a regular, recursive 
decomposition of 2-D space. Quadtrees are used in the construction of some multidimensional 
databases (e.g., cartography, computer graphics, and image processing). A quadtree is a means 
of encoding an image as a tree structure. Each node of the tree has up to four children. The root 
node represents the entire image; its children represent the four quadrants of the entire image; 
their children represent the sixteen subquadrants, and so on. Because the region quadtree is an 
areal-based representation, it supports highly efficient top-down spatial search and data 
manipulation, as well as areal-based set operation generation. 

Despite these benefits, the region quadtree representation exhibits significant 
shortcomings. Because the "size" of the quadtree representation increases by a factor of four for 
each level of decomposition, excessively large storage requirements (and concomitant 
computational requirements) can occur for applications that demand high fidelity data 
representation. Tuple-based representations, on the other hand, provide memory-efficient, high 
fidelity representations of point, line, and region features. However, because such 
representations do not directly support spatial search and spatial data manipulation, data analysis 
tends to lead to highly combinatorial computational requirements. 

Thus, both tuple-based and region quadtree-based spatial data representations possess 
strengths, as well as obvious weaknesses. Spatial search, reasoning, and set generation that tends 
to be straightforward and efficient for quadtree-based representations, but lead to 
computationally-intensive algorithms for tuple-based representations. Data storage requirements 
to support high fidelity data sets are readily met with tuple-base representations, while direct use 
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of region quadtree representations leads to excessively large storage and associated data 
manipulation requirements. 

There are few prior efforts at more efficient set operation generation procedures. One 
example, U.S. Patent No. 5649084, discloses a method for performing boolean operations on 
geometric objects in a computer-aided design system. Edges of a first object are intersected with 
surfaces of a second object to produce intersection points, and surfaces containing the faces of 
the two objects, respectively, are intersected with each other to produce intersection tracks. If 
there is an inconsistency between the intersection points and corresponding intersection tracks, a 
perturbation step is applied to correct the spatial positions of inconsistent intersection points. 
This maintains geometric consistency and avoids overly complex set operations. However, it 
also changes the original objects and sacrifices accuracy. 

Rather than seeking incremental enhancements over traditional approaches, in the early 
1980s the present inventor pioneered an entirely new data representation structure referred to as 
the region quadtree-indexed vector representation. See, Antony, R. and Emmerman, P., Spatial 
Reasoning and Knowledge Representation, Geographic Information Systems in the Government 
Workshop Proceedings, A. Deepak Publishing (1 986). This structure helps to dramatically 
reduce the computational requirements for set operation generation among both simple and 
multiply-connected polygon-represented data sets. The quadtree-indexed vector representation 
is, in effect, a hybrid data structure that combines the benefits of a hierarchical, multiple 
resolution grid-based spatial representation (the region quadtree data structure) with the accuracy 
and storage efficiency of tuple-based (polygon boundary) representations. In the 
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quadtree-indexed vector representation, the region quadtree serves not only as a spatial indexing 
mechanism into memory-efficient, high fidelity representations of points, lines, and region 
boundaries, but as a multiple resolution spatial representation in its own right. 

FIG, 1 illustrates the relationship between an indexing cell(s) and the underlying 
tuple-based data. Note that for point and line (commonly referred to as polyline) features, every 
quadtree cell in the representation "indexes" tuple-based data. For regions, however, there exist 
two classes of quadtree indexing cells: those that index interior cells and those that index 
boundary cells. Interior cells are quadtree cells that are fully included within a region; boundary 
cells are partly within and partly outside a region. Thus, for simply-connected regions, only 
quadtree boundary cells associated with the region's boundary directly index tuple-based data. 

For polygons that possess one or more holes as shown in FIG. 2, tuple lists define not 
only the outer boundary of the polygon, but also the boundaries of all included holes. The tuple 
list of a hole serves both as the description of the edge of the hole, as well as a description of an 
inner edge of the region. The "direction" associated with the tuple list of a first-order hole is 
opposite that of the outer boundary list convention. For embedded holes, each subsequent tuple 
list is order-reversed to maintain logical consistency with respect to the "inside" and "outside" of 
the region. 

FIG. 3 illustrates tuple-represented piece-wise continuous line feature representing a 
region boundary along with the associated quadtree-indexing cells. This figure shows the 
association between line segment vertex tuples and the quadtree-indexing cell (actual data points 
are shown as filled circles; pseudo points are shown as open circles). The outer square in the 
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figure represents an indexing cell at some arbitrary level of decomposition. Data points 2, 3, 4, 
5, and 10 are seen to lie within the cell. The quadtree-indexed vector spatial data structure 
requires the addition of pseudo points at all cell entrance and exit boundaries. Referring back to 
FIG 3, four pseudo points are associated with this particular cell: points 1, 6, 9, and 11. If 
represented at the next higher resolution quadtree grid size (indicated by the four smaller squares 
in Fig. 3), additional pseudo points are required at the boundaries of the smaller indexing cells. 
For example, the upper right indexing cell consists of the entrance tuple pseudo point 3', the data 
tuples 4 and 5, and the exit tuple pseudo point 6. Because one or more data tuples could fall on 
a cell boundary (a data tuple actually lies at the cell boundary or a line segment follows the cell 
boundary), the implementation must distinguish pseudo points from the original tuple list. 
Adjacent cells maintain duplicate shared cell boundary tuples. The lower right and the upper 
right cells, for example, share tuple 3'. 

In addition to distinguishing between data point tuples and indexing cell boundary 
pseudo points, the implementation must distinguish between interior (non-boundary) indexing 
cells and boundary indexing cells, 

FIGs. 4(a)-(c) shows the interaction between two regions, and FIGs, 4(c)-(d) shows the 
interaction between the region's respective quadtree grids. Unless two regions share at least one 
common quadtree indexing cell, no direct interaction between the two data sets occur. When no 
common cells occur, set intersection is null and set union is the set formed by all the cells in the 
two disjoint regions. However, when common cells do occur, the set intersection is as seen in 
Fig. 4(0. 
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FIG. 5 illustrates a categorical approach for analyzing set operations that was developed 
by the inventor herein and is described in Antony, R,, Principles of Data Fusion Aittomation 
(Artech House, 1995). When set intersection exists, set operation generation is treated as a 
three-stage (sub-problem) process involving the following canonical form classes. 
Class 1 : interactions between two interior cells, 
Class 2: interactions between a boundary and an interior cell, and 
Class 3: interactions between two boundary cells. 
The first two stages entail only relatively trivial computations, and the appropriate 
methods for generating the products from both Class 1 and Class 2 are described in the^/;/o/?y 
treatise, supra at 95. The third stage, however, is considerably more involved and effectively 
controls the computational complexity of the set operation generation process. Because both 
polylines and points can be treated as degenerate regions (regions with no "interior cells"), set 
union and intersection operations for these lower order features can be treated as a special case 
of the region generation methodology. 

Consequently, it would be greatly advantageous to provide an optimal method for 
generating the product of the third, and key stage of the procedure: class 3 interactions between 
two boundary cells. 
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SUMMARY OF THE INVENTION 

It is, therefore, an object of the present invention to provide an improved method for 
generating Boolean set intersection and set union among polygon-represented data sets using a 
digital computer. 

It is another object to provide a method as described above which is computationally 
superior to current computational geometry methods, giving orders of magnitude improvement 
in efficiency over current computational methods. 

It is another object to provide a method of Boolean set operations that fully capitalizes on 
the efficiencies gained by quadtree-indexed vector representation of "tuple-based lists" that 
define region(s) associated with the union and intersection of the original data sets. 

According to the present invention, the above-described and other objects are 
accomplished by providing a method and software implementation for computing Boolean set 
operations on two regions defined by a quadtree-indexed vector representation of data point 
tuples. The method steps initially include establishing indexing cells about the two regions. 

Next, the method distinguishes the three canonical form classes: Class 1 interactions 
between two interior cells ("Interior x Interior" indexing cells) containing overlapping interior 
cells of both regions; Class 2 interactions between a boundary and an interior cell ("Interior x 
Boundary" indexing cells) containing interior cells of one region and a portion of a boundary of 
the other regions; and Class 3 interactions between two boundary cells ("Boundary x Boundary") 
indexing cells which contain a portion of the boundaries of both of the regions. As described 
above, the first two stages entail only relatively trivial computations, and the appropriate 
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methods for generating the products from both Class 1 and Class 2 are described in the Antony 
treatise, supra at 95. The present method focuses on the third Class 3 which is considerably 
more involved and effectively controls the computational complexity of the set operation 
generation process. 

As a next step, pseudo points are defined for each Boundary x Boundary indexing cell. 
Four separate pseudo points are defined for each Boundary x Boundary indexing cell, one where 
each of the two regions enter and exit the boundary indexing cell. 

Next, each Boundary x Boundary indexing cell is categorized based on a relationship of 
its defined pseudo points, and one of the two regions is selected to be a starting region for that 
indexing cell based on the categorization. 

Given the category and starting region, the set operation is performed by including a list 
of tuples in each Boundary x Boundary indexing cell that are encountered while following 
("tracing") the boundary of the starting region until an intersection of the boundaries of said two 
regions occurs, then accumulating tuples associated with the other of region, and so on until the 
indexing cell is traversed. 

Given the tuples included from the set operations on all Boundary x Boundary indexing 
cells, it is a simple task to combine the result with the set operations on Interior x Interior 
indexing cells and Interior x Boundary indexing cells in a known manner to produce a complete 
union or intersection. 
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The foregoing method fully capitalizes on the efficiencies gained by quadtree-indexed 
vector representation, and as shown below is capable of giving orders of magnitude 
improvement in efficiency over current computational methods. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other objects, features, and advantages of the present invention will become more 
apparent from the following detailed description of the preferred embodiment and certain 
modifications thereof when taken together with the accompanying drawings in which: 

FIG. 1 is a graphical illustration of interior and boundary quadtree cells associated with a 
polygon-represented region. 

FIG. 2 is a graphical example of a multiply-connected polygon having a hole. 

FIG. 3 illustrates a two-level decomposition of a quadtree-indexed vector spatial 
representation. 

FIG. 4(a)-(f) illustrates an example of the quadtree-based intersection of two regions. 

FIG. 5 is an exploded view of the set operation interaction shown in Fig, 4. 

FIG. 6 is a flow chart implementation reflecting the software steps for implementing the 
software method of the present invention. 

FIG. 7 is a graphical illustration of the intersection of a boundary cell and an interior cell 
(upper row a-c) and the intersection of two boundary cells (lower row d-f). The interior of all 
regions are shown as shaded. 
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FIG. 8 is a graphical illustration of the various kinds of bounding box relationships 
between line segments for two features. 

FIG, 9 is a graphical example of two lineal features within a quadtree indexing cell 
showing each tuple-pair bounding box. 

FIG. 10 is a graphical illustration of the four possible interactions between two line 
segments whose bounding boxes intersect 

FIG. 11 is a graphical illustration of two region boundary features in two adjacent 
boundary cells. 

FIG, 12 is a graphical illustration of two region boundary features that both reenter a 

cell. 

FIG. 13 is a graphical illustration showing a feature boundary reentry. 

FIG. 14 is a catalog of twenty-four possible boundary cell interaction cases, 

FIG. 15 is a table that summarizes the prescribed manner for following (or "tracing") the 
features from a designated entry or exit point and onward to the next polyline intersection point. 

FIG. 16 is a graphical illustration of a grid of nine quadtree-indexing cells associated 
with two polygon-represented regions. 

FIGs, 17 and 18 provide expanded views of the center cell of Fig. 16. 

FlGs, 19-22 provide graphical illustrations of representative examples that demonstrate 
the application of FIG. 15 for Categories III - VI. 

FIG. 23 is a graphical illustration of a representative example Category I interaction. 

FIG. 24 is a graphical illustration of a representative example Category 11 interaction. 
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FIGs. 25-28 provide graphical examples that illustrate the analysis of regions with 
included holes, 

FIG, 29 is a table that summarizes general characteristics of each canonical form class. 
FIG. 30 is a graphical illustration of two regions exhibiting (a) limited boundary cell 
overlap and (b) two regions exhibiting extensive edge overlap. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMEl^TS 

An entirely new and improved method and software implementation is disclosed for 
generating Boolean set intersection and set union among two regions defined by a 
quadtree-indexed vector representation of data point tuples using a digital computer. Rather than 
relying on traditional cell-by-cell ANDing operations for identifying indexing cells that interact, 
the present method treats index cell interaction as a spatial search problem. 

Two of the steps are known {Antony treatise, supra at 95. ), and these include the steps 
of: 1) establishing indexing cells about the two subject regions; and 2) distinguishing three 
canonical form classes of indexing cells: Class 1 interactions between two interior cells 
("Interior x Interior" indexing cells) containing overlapping interior cells of both regions; Class 
2 interactions between a boundary and an interior cell ("Interior x Boundary" indexing cells) 
containing interior cells of one region and a portion of a boundary of the other regions; and 
Class 3 interactions between two boundary cells ("Boundary x Boundary") indexing cells which 
contain a portion of the boundaries of both of the regions. The remaining novel steps, when 
performed in combination, speed set operation processing considerably. These steps are 
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generally as follows: 3) defining pseudo points for each Class 3 Boundary x Boundary indexing 
cell; 4) categorizing each Boundary x Boundary indexing cell based on a relationship of its 
defined pseudo points, and identifying one of the two regions to be a starting region for that 
indexing cell based on the categorization; and 5) given the category and starting region, 
performing the set operation by including all tuples in each Boundary x Boundary indexing cell 
that are encountered while following ("tracing") the boundary of the starting region until an 
intersection of the boundaries of the two regions occur, then accumulating tuples associated with 
the other region, and switching regions at each intersection until the indexing cell is traversed, 

FIG. 6 is a flow chart implementation reflecting the software steps for implementing the 
above-described method. 

At step 10, the method initializes the bounding box and then establishes a grid of 
indexing cells about two subject regions. Initialization of the bounding box is simply 
accomplished by establishing the problem space about two subject regions, and then 
decomposing the problem space into the grid array of indexing cells. At the first level of 
decomposition, a selected problem space is sub-divided into four equal-sized quadrants. Each of 
these quadrants is then recursively decomposed into four equal size quadrants. Preferably, the 
number of levels of decomposition can be user-selected. At each level of decomposition, the 
quadtree provides a mutually exclusive and exhaustive representation of a selected region of the 
problem space. The net result is a variable (user-sized) grid of indexing cells about the two 
subject regions. 
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At step 20, pseudo-points are defined for the borders of the two regions. For each 
indexing cell of the grid that contains part of the boundary of at least one region (a Boundary 
indexing cell), pseudo-points are assigned at the position of entry and exit of each of each 
region's border from that indexing cell. 

At step 30, the desired type of set operation is retrieved (by user-input or pre- 
programming), and this may either be Quad intersection or union. 

At step 40, the vector operation is performed. The software implementation of this step 
preferably involves two precautionary pre-tests to avoid undue processing. A first pre-test is run 
to ensure that the operation is not being performed on the same region (as opposed to two 
different regions). If performed on the same region, the solution for both indexing cell (Quad) 
intersection or union is simply the list of tuples associated with the singular region. The second 
pre-test is performed to ensure that the solution is not a null solution (in other words, that the 
two regions do interact by some degree). 

Assuming some interaction, the operation continues to the novel processing steps in 
accordance with the present invention. Interactions between indexing cells are classified as 
follows: interactions between two Interior cells ("Interior x Interior" indexing cells), interactions 
between a Boundary and an Interior cell ("Interior x Boundary" indexing cells), and interactions 
between two Boundary cells ("Boundary x Boundary") indexing cells. These three classes are 
treated separately as follows: 



Class 1 : Interior x Interior cells 
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All matching interior cells within the two regions are added to the intersection product. 
For set union, all (non-duplicate) interior cells of the two regions are included in the product. 
Duplicate cells can be readily identified as those interior cells that are within the set intersection 
product (to be within the intersection product, both regions must necessarily contain identical 
cells). If the cells in the two data sets possess different indexing cell sizes (i .e., different levels of 
quadtree decomposition), the product for set intersection is the smaller of the two cells, while the 
product for set union is the larger of the two cells. Consequently, both class-1 intersection and 
union generation operations inherently handle disparate size quadtree cells. 

Class 2: Boundary x Interior cells 

As illustrated in the first row of Fig. 7 (a-c), the set intersection of a boundary cell (a) 
and an interior cell (b) yields the boundary cell (c), unless the boundary cell is physically larger 
(coarser decomposition level) than the interior cell. In this case, the intersection product is the 
boundary representation that occurs within the smaller interior quadtree cell. For set union, the 
product is the interior cell (b), independent of cell size. 

Class 3: Boundary x Boundary cells 

Whereas Class-1 and Class-2 set operation products are straightforward to compute, 
Class-3 products (second row in Fig. 7) represent a considerably more challenging problem. The 
method of the instant invention includes an optimal procedure for generating Class-3 Boolean set 
operation products. The optimal procedure stems from the realization that for set operation 
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generation, there exist six categorical Class-3 product forms. These forms are topologically 
complete in the sense that they characterize all possible region boundary interactions between 
two data sets. Moreover, given the canonical form category of a particular Boundary cell x 
Boundary cell, the set operation procedure for that cell can be determined based solely on the 
ordering of the entrance and exit pseudo-points along the boundary of a quadtree-indexing cell. 
This characterization leads to a well-behaved procedure capable of efficiently constructing both 
the intersection and union product for a given indexing cell once (1) the canonical form category 
is established and (2) all polyline feature intersection points within the cell are computed. As 
proven below, the procedure is potentially order of magnitudes faster that traditional 
computational geometry approaches. Furthermore, since each Boundary cell x Boundary cell 
operation is fully independent, Class-3 operations can be performed in parallel, further 
enhancing set operation generation performance. 

Specifically, the Class-3 operation begins by employing simple algebraic techniques to 
compute all intersection points between the linear boundaries of the two regions within an 
indexing cell. Once computed, these intersection points are added at their appropriate location 
within each feature's ordered tuple lists. 

FIG. 8 is a graphical illustration of the various kinds of bounding box relationships 
between line segments for two features. 

FIG. 9 is a graphical example of two lineal features within a quadtree indexing cell 
showing each tuple-pair bounding box. Arrowheads indicate line segment direction. 
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With collective reference to FIGs. 8 and 9, note that the intersection points must be 
tagged to distinguish them from both data tuples and pseudo points. This is necessary because 
hounding hoxes associated with individual tuple pairs from two (or more) regions might intersect 
while the line segments (represented by a tuple pair) might not intersect. In the example shown 
in Fig. 8, the bounding boxes for some black and gray segments overlap while others do not (For 
convenience, the terms "black" and "gray" are used throughout this document to distinguish the 
two features that exist within a given indexing cell. However, the designation is totally 
arbitrary). Because line segments whose bounding boxes overlap may not actually intersect, 
bounding box intersection is merely a precursor to line segment intersection determination. 

A simple procedure, such as the approach offered below, can be used to compute all line 
segment intersection points within a boundary cell: 

1 . Test all line-segment bounding boxes associated with tuple pairs for possible 
intersection. 

2. For all bounding box pairs that intersect, determine possible intersection points of 
the line segments by first computing the intersection points of the associated lines (Note: these 
lines extend to infinity): 

y = m X - m Xj + yi (associated with gray segment) 
y' = m' X* - m' x'j + y'j (associated with black segment) 
where m - (yj^, - y^ / (Xj,, - Xj) 
ni'-(yVi-y'i)/(x'H,-x\) 
y = mx-mXi-fyj = mx + K where K= - m Xj + yi 
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y' - m' X - m' x'i + y'i - m' x + K' where K' = - m' x'j + y'j 
Set X = x' and y = y' to locate point of intersection: 
=>(ni-m')x =K'-K 
=> x intersection point = (K' - K) / (m - m') 

To prevent overflow, it is necessary to check to see if m = m' before this computation is 
performed. If m ^ m\ do not attempt to compute the intersection point since the two lines are 
parallel and therefore can not intersect. Once the x value of the intersection point is computed, 
then compute the y intersection point: 

y intersection point ~ m x - m Xj + yj 

3. To determine whether the line segments actually intersect, test whether the computed 
intersection point lies within the line-segment bounding boxes for both features. If the 
intersection point lies within both bounding boxes, the two line segments intersect; otherwise, 
they do not intersect. 

4. All discovered intersection points are then added to both feature tuple lists between 
the appropriate end point tuples. 

FIG 10 illustrates the four possible interactions between two line segments whose 
bounding boxes intersect. In type 1 interaction, no intersection occurs. In type 2, line segments 
touch at a point. In type 3, line segments from the two features are co-linear. In type 4, two 
features cross each other at a point. 
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FIG. 1 1 illustrates two region boundary features in two adjacent boundary cells. In this 
case, the black feature begins in the left-hand cell, moves to the right-hand cell, and then reenters 
the left-hand cell In such case, the intersection product follows the gray (or "inside" feature) 
boundary. 

FIG, 12 illustrates two region boundary features that both reenter a cell. In this case, 
each cell is treated independently, and all secondary (tertiary, etc.) entrance points are handled 
sequentially. In the case of the lefthand cell in Fig. 12, the boundaries marked "Black 1" and 
"Gray 1" could be treated first, the product of "Black 2" and "Gray 2" treated second, and the 
two results combined. Alternatively, the first product could be combined with "Black 2" and the 
resuhant product combined with "Gray 2." The basic analytical process described herein, 
however, remains the same. 

FIG. 13 illustrates a feature boundary reentry and demonstrates that the intersection 
product consists of two distinct closed line-segment "cycles" as outlined by the bold black 
curves. The cycles include both intersection points and pseudo-points. A set operation is 
performed by including a list of tuples encountered while following ("tracing") the cycles until 
complete. However, the tracing of line-segments depends on the orientation of the region 
boundaries, which is in turn reflected by the orderings of entry and exit pseudo-points. 

FIG. 14 catalogs the twenty-four possible boundary cell interaction cases, providing a 
systematic and exhaustive characterization of all possible entrance and exit pseudo-point tuple 
orderings. Because, as already discussed, the starting tuple for the orderings is entirely arbitrary, 
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four cyclical orderings exist that describe the identical entrance and exit point relationships. For 
instance: 

Gray Entrance (Ge), Black Entrance (Be), Black exit (Bx), Gray exit (Gx) 
characterizes the identical cell boundary relationships (and consequently represents the same 
canonical form) as: Black Entrance (Bp), Black exit (Bx), Gray exit (Gx), Gray Entrance (Ge). 
Consequently, there exist only six canonical orderings. 

tn the method of the present invention, the set operation is completed by collecting tuples 
from the two region lists (in a manner prescribed by the canonical ordering of entrance and exit 
point relationships) until the cycle closes on itself (the sequence of assigned tuples returns to its 
starting tuple). 

More specifically, Class-3 union and intersection operations generate two types of tuple 
cycles: boundary closing and internal. Boundary-closing cycles are formed by starting at either 
a black (Be) and/or (depending on the boundary cell x boundary cell category) gray (GpJ feature 
entry tuple (or pseudo point). Boundary cycles are so named because the cycle implicitly closes 
along the cell boundary (or edge). Internal cycles begin at a polyline intersection point and 
follow features in a prescribed manner, until the initial intersection point is reached (or the cycle 
is complete). In both cases, the features are followed from a designated entry or exit point and 
onward to the next polyline intersection point of the two regions. The tracing then switches to 
the other region, accumulating tuples associated with the other region, and so on until the 
indexing cell is traversed. This process of switching features at each successive polyline 
intersection point continues until the cycle is complete. A boundary-closing cycle is complete 
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once the edge of the quadtree cell is reached (in other words, when a pseudo point on the 
indexing cell boundary is hit). Internal cycles are complete when the accumulation process 
reaches the starting tuple for that cycle (without encountering a pseudo point on the indexing cell 
boundary) 

FIG. 1 5 is a table that summarizes the prescribed manner for following (or "tracing") the 
features from a designated entry or exit point and onward to the next polyline intersection point, 
etc., and onward according to the generation methodology of the present invention. Column I 
lists the six unique cyclical orderings of entrance and exit tuples for a given quadtree-indexed 
cell. Column 2 identifies the assigned interaction Category ("interaction class") number (note 
that the categories are logically grouped and are not in strictly numeric order). Columns 3 and 4 
specify the starting points for intersection and union generation, respectively, assuming "inside is 
to the right" Columns 5 and 6 specify the starting points for intersection and union generation, 
respectively, assuming "inside is to the left." 

This recursive cycle generation process continues until all polyline intersection points 
within the cell have been used. As indicated by FIG. 15, for Categories III- VI, both intersection 
and union cycle generation begins with a boundary-closing cycle (cycle begins at a pseudo 
point). However, this is not the case of Categories I and II. We describe the cycle generation 
process for Categories III- VI first and then discuss the necessary modifications required to 
accommodate Categories I and II. 

Category III- VI Intersection/Union Product Generation 
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The principal steps in constnicting Category III - VI (boundary cell) x (boundary cell) 
intersection and union products can be summarized as follows: 

(1) Evaluate the canonical form Category by determining the counter-clockwise ordering 
of the cell boundary entrance and exit points for the two features present in the cell. If one or 
both features reenter the cell, each interaction can be treated separately, or alternatively, the cell 
can be further decomposed to attempt to eliminate or at least reduce the number of feature 
re-entries, and thereby the complexity of the analysis. 

(2) Construct the appropriate boundary-closing cycle(s) by determining from FIG. 1 5 the 
appropriate cell entry point (either gray entrance or black entrance) to begin cycle construction. 
Category III- VI possess only a single boundary closing cycle. 

(3) Because boundary-closing cycles close on the edge of the quadtree-indexing cell, 
processing for such a cycle terminates once a pseudo point on the cell boundary is encountered 
during the cycle traversal operation. The boundary-closing segments of a boundary-closing cycle 
represent implicit boundaries and are not actually reported as part of the set operation product. 

(4) Once the boundary-closing cycle has been constructed (the tuples collected), the next 
"unused" polyline intersection point (if one exists) associated with the "entrance" feature list 
becomes the starting point for an internal cycle. Just as before, succeeding tuples for that feature 
are added to the list (effectively "tracing" the line segments in the region boundary) until the 
next internal polyline intersection point is reached. At that point, the tuple accumulation process 
("feature-following") shifts to the other feature. This cycle-forming process continues until this 
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new cycle is complete (the initial tuple in the cycle list is reached). Additional cycles, if they 
exist, begin at the next unused polyline intersection point. 

(5) This "cycle generation" process is repeated until all the polyline intersection points 
within the cell have been utilized. The generated tuple list(s) becomes the final set operation 
product for that cell 

The Boundary cell x Boundary cell set operation generation process just described is 
further illustrated in the example shown in FIG. 16. 

FIG. 16 depicts a grid of nine quadtree-indexing cells associated with two 
polygon-represented regions. Cell 1 does not index either feature and therefore is not involved 
in any subsequent analysis. Cells 2, 3, 5, 6, 8, and 9 index the boundary of the gray feature; 
cells 4-9 index the boundary of the black feature. Cells 2, 3, 4, and 7 index only a single feature 
and therefore contribute to the union product, but not to the intersection product. Cells 5, 6, 8, 
and 9 contain both gray and black feature tuples and therefore represent Class 3 interactions 
between two boundary cells {Antony treatise, supra at 95). 

To simplify the discussion, we focus our attention on only the center cell (cell 5) in Fig. 

16. 

FIGs. 17 and 18 provide expanded views of the center cell of Fig. 16. Specifically, FIG. 
17 illustrates the intersection product generation process; FIG, 18 illustrates union generation. 
The polyline intersection points (inserted into the working memory tuple lists for both features) 
are shown as solid black circles; the entrance and exit pseudo points are indicated by open 
circles. The counter-clockwise ordering of the entrance and exit tuples (GxBeGeBx) represents a 
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cyclical ordering of Category V (FIG. 15, column 1, row 6). Using the convention that the 
inside of a region is "to the right," column 3 of FIG, 15 indicates that for a Category V 
interaction, intersection generation requires a single boundary-closing cycle (as seen in last row, 
third column). As indicated by this entry, the intersection product begins with the gray entrance 
pseudo point G^. 

The desired set operation product for a cell is a list of all tuples for that part of the region 
intersection that occurs within the cell shown in Fig. 17, Starting with the gray entrance point 
tuple, the gray feature is followed (i.e., all encountered gray tuples in the list are added to the 
intersection product) until the first polyline intersection point occurs (labeled point 2 in the 
figure). At the intersection point, the tuple accumulation process continues by accumulating 
tuples associated with the black feature (point 3 is added to the product) until the next 
intersection point occurs (point 4). Traversal is always in ascending order of the tuple lists. 
Because point 4 is a polyline intersection point, accumulation continues by collecting tuples 
associated with the gray feature (point 5 is added to the list) until the third intersection point is 
reached (point 6). Tuple accumulation resumes by following the tuple list of the black feature 
until the black feature reaches the cell boundary. At cell exit point 7, the boundary-closing cycle 
processing terminates. (As discussed previously, a boundary-closing cycle implicitly closes on 
itself by following the boundary edge in the appropriate direction until the starting point of the 
cycle is reached. However, boundary cell edges are not reported as part of the set operation 
product). Thus, the intersection product for Cell 5 is reported as the list of points 1 - 7. The bold 
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gray arrows in Fig. 16 trace the boundary closing cycle. The gray-shaded hatched region denotes 
the "implied" region intersection product. 

FIG. 18 illustrates the set union product generation for the center cell of Fig. 16. 
Referring to FIG. 15, column 4, row 6, the union product is seen to begin with the black 
entrance feature. Just as in set intersection generation, the union product is produced by 
accumulating tuples from the two feature tuple lists (to which have been added all appropriate 
pseudo points and embedded polyline intersection points), alternating between black and gray 
tuples following each polyline intersection point. The union product is reported as the ordered 
(region boundary) list containing the seven points shown in Figure 18. As is apparent, the only 
tuples shared by both the intersection and union products are the polyline intersection points. In 
this case, the union product is indicated by the black hatched region. 

FIGs. 19-22 provide further representative examples that demonstrate the application of 
FIG. 1 5 for Categories III - VL Intersection points are numbered in the direction of their use for 
intersection product generation. When there are complex interactions between two sets, it is 
often easier to picture NOT Union (regions outside the union product). Thus, in the following 
examples, regions not contained in (outside) the union product are outlined in bold black lines. 
Intersection cycles are outlined in gray. In other words, for set union, the entirety of each cell is 
inside the union product except for the union cycles outlined by bold black lines. 

Category I and II IntersectionAJnion Product Generation 
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Assuming "inside is to the right" for a Category I (boundary cell) x (boundary cell) 
interaction, set union begins with two boundary-closing cycles. For set intersection, on the other 
hand, there exist no boundary-closing cycles. The union boundary-closing cycles begin at both 
the gray and black feature entry points. For Category II, set intersection begins with two 
boundary-closing cycles. Set union contains no boundary-closing cycles. The intersection 
boundary-closing cycles begin with both the gray and black entry points. Just as in earlier 
examples, the intersection cycles shown in bold gray depict regions of the cell that are included 
in the product, whereas union cycles indicated in bold black depict regions of the cell that are 
not included in the union product. 

Examples of the two cell interaction Categories are shown in FIGs. 23 and 24. 

FIG. 23 offers an example of Category I interaction. As indicated in FIG. 1 5, for set 
union, boundary-closing cycles occur at both the black and the gray feature cell entrance points; 
the union cycles are shown by the bold black lines beginning at both Gr and Bp. Because "inside 
is to the right," these black triangular-shaped regions represent area excluded from the union, 
depicting areas within the cell that are not included in either region. The internal cycles shown 
outlined in bold black lines represent additional cycles ("holes" in the union). Note that 
alternating with the bold black cycles, are other cycles are outlined in bold gray lines. As in the 
earlier discussion, these cycles are associated with set intersection. As indicated in FIG. 15, set 
intersection cycles are generated by starting with either the black or gray feature at the first 
polyline intersection point (rather than the cell boundary pseudo point). Once the first 



Application of Antony, Richard Page -28- 

intersection cycle is complete, additional cycles are generated (as discussed previously) until all 
the polyline intersection points have been exhausted. 

FIG, 24 illustrates a Category 11 interaction problem. Once again, we assume that inside 
is to the right. As indicated in FIG. 15, there exist two intersection and no union 
boundary-closing cycles. The two intersection boundary-closing cycles are outlined in bold gray 
lines. The balance of the internal cycles shown outlined in bold gray lines represent additional 
cycles associated with the intersection product. Sandwiched between these intersection cycles are 
a series of those portions of the cell which are excluded from the union, outlined by the bold 
black lines. Identifying the portions of the cell excluded from the union cycle is accomplished by 
beginning at the first polyline intersection point associated with either the black or gray feature 
and continuing to accumulate cycles in the manner described previously until all the polyline 
intersection points have been exhausted. 

Despite subtle differences between the construction of Category I & 11 and Category III - 
VI set operation products, the mechanics of the process are obviously quite similar. Once the 
first cycle has been developed (be it a boundary-closing or internal cycle), the next cycle (for the 
same Boolean set operation product) begins at the next unused intersection point. The 
construction process terminates when no unused intersection points remain. Note also that union 
cycles occur between intersection cycles and conversely, intersection cycles occur between union 
cycles, regardless of the cell interaction Category. Because of this characteristic of the method of 
instant invention, rather than independently generating intersection and union products, the 
method permits efficient construction of both products during a single pass. 
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Regions with Included Holes 

Because the tuple ordering convention provides a consistent interpretation of a region's 
interior, the instant set operation generation procedure handles regions containing included holes 
without difficulty. FIGs. 25-28 provide examples that illustrate the straightforward nature of the 
analysis. As is evident in these figures, hole boundaries are handled identically to outer region 
boundaries in terms of set operation generation procedures. Note that in FIG. 27 and 28, the 
intersection product remains the same, regardless of whether the hole is fully enclosed or only 
partially enclosed by the quadtree indexing cell. 

Recursive Operation 

Once a given cycle is complete, a determination must be made whether the analysis of 
that cell is finished. Cell processing is complete when no additional unused polyline intersection 
points exist. If unused polyline intersection points remain, at least one additional internal cycle 
must exist. The next remaining cycle begins at the next unused intersection point along the 
entrance feature indicated in FIG. 15 (actually, either feature may be followed as long as cycles 
are developed in ascending tuple order) and continues until the cycle is complete. Note that the 
number of internal cycles that exist within the cell is highly sensitive to the characteristics of the 
data sets. Thus, for two or more internal intersection points, the number of internal cycles can 
not be reliably predicted. 



Final Product Development 
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The final set operation product of step 40 (FIG. 6) for two quadtree-represented regions 
is assembled as the concatenation of the products generated by the three stages of analysis. 
Class-] analysis contributes only region interior cells. For set intersection, both Class-2 and 
Class-3 generate only boundary cell products; for set union, Class-2 generates only interior cells, 
while Class-3 generates boundary cell products. Under special conditions, the latter can generate 
interior cells. If the desired set operation product is a vector boundary list (rather than a 
quadtree-indexed vector representation), the ordered tuple list or lists (in the case of disjoint of 
embedded set operation products) is generated by a simple quadtree-indexed vector boundary 
cell-to-vector boundary conversion which effectively strips off the indexing cell representation. 

In all of the examples presented in this document, the area "inside" a region has been 
assumed to be "to the right" of the ordered tuple boundary list. To maintain this convention, 
region boundaries use a clockwise tuple ordering. If, on the other hand, tuples are ordered in a 
counter-clockwise direction, "inside" will be "to the left." 

FIG. 29 is a table that summarizes general characteristics of each canonical form class 
and reflects the fact that very similar procedures are appropriate for constructing set operation 
products for both ordering conventions. Columns 1 and 2 are identical to those in FIG. 1 5. 
Column 3 identifies whether the total number of polyline intersection points within a cell is odd 
or even. Column 4 identifies whether the entry /exit pseudo points for a given feature are 
adjacent (e.g., GeGxBeBx) or whether they alternate (e.g., GeBeGxBx). Columns 6 and 7 indicate 
the total number of intersection and union cycles that occur within the cell for the number of 
polyline intersection points specified in column 5. 
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Referring back to the software flow of FIG. 6, after the final set operation product is 
calculated in step 40, the software method of the present invention proceeds through step 50 
where the set operation product is displayed and/or otherwise output for the user. At step 60 the 
software preferably releases the memory resources that it used for the foregoing computation. 
The user is given the option of completing another set operation product at step 70, and if such is 
desired the program returns to step 30. Otherwise, at step 80 the software releases the memory 
resources that it used in establishing the bounding box. The user is given the option of adding 
more features to the problem at step 90, and if such is desired the features are added and the 
program returns to the beginning (A) for initialization of a new bounding box. Otherwise, the 
program is done. 

The foregoing method capitalizes on the efficiencies gained by quadtree-indexed vector 
representation, and gives orders of magnitude improvement in efficiency over current 
computational methods. 



In the following explanation, the computational complexity of the proposed set operation 
generation approach will be compared to traditional computational geometry approaches to 
prove the efficiencies gained by the latter. When set operations are performed on two regions 
possessing significantly different sizes, the computational efficiency improvement over 
traditional computational geometry approaches is on the order of the number of boundary cells 
associated with the larger of the two regions. Because this number can be quite large, the 



Efficiency 
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potential reduction in computational requirements permits the implementation of a wide range of 
demanding applications that heretofore have been impractical. 

Traditional computational geometry approaches to polygon intersection between two 
simply-connected regions are known to possess computational complexity of order m x n (where 
m is the number of tuples in one region and n is the number of tuples in the other region). Thus, 
for even moderately large values of m and n, computational requirements tend to be quite high. 

FIG 30 is a graphical illustration of two regions exhibiting (a) limited boundary cell 
overlap and (b) two regions exhibiting extensive edge overlap. 

When only a relatively small percentage of the boundary cells of the two regions 
intersect (as in Fig. 30(a)), the computational requirements of the VGS approach tends to be 
rather modest. The worst-case conditions occur when the two regions are of approximately equal 
physical size and possess significant overlap (see Fig. 30(b)). In this case, most, if not all, the 
boundary cells of the two regions interact. Thus, unlike traditional set operation generation 
approaches that tend to be somewhat independent of either the detailed character of the regions 
or their degree of overlap, the computational complexity of the quadtree-indexed vector based 
approach is highly sensitive to both considerations. Even the worst case performance can be 
orders of magnitude better than traditional computational geometry approaches. The number of 
tuples associated with the quadtree boundary cells for two regions will be defined as mi and nj, 
respectively. Thus, m = mj and n = nj, where i = 1 , 2, . . . , I and j 1 , 2, . . . , J. The "worst 
case" computational complexity of the boundary cell set operation generation method is 
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therefore of order (rrii x = (m^ x n^) + (nxj x n^) + (mj x n^) + . . , By rewriting the 
computational complexity of the conventional computational geometry approach we find 

m X n = (m, + m2 + m3 +...) x (n, + n2 + n3 ^ (m, x n,) + x n2) + (m^ x n^).,. 

+ (mi X n2) + (mi x n3) + (m2 x ni) + (m2 x nO + . . . 

The computational requirements for the computational geometry approach is seen to 
involve a large number of cross-product terms not present in the proposed approach. As a 
consequence, the computational requirements associated with the new approach tend to be 
considerably less than that of traditional formulations. In addition, because the summation (mi x 
nj) need only be performed over those cells whose boundary cells are shared between the two 
regions, as stated above, the actual computational complexity will, in general, be considerably 
less than the worst case analysis predicts. 

To simplify the derivation, we will assume that all boundary cells contain a uniform 
number of tuples (i.e., mj m/I and nj = n/J for region 1 and region 2, respectively). Based on 
the worst-case node (cell) count theorem, it can be assumed that the number of interior and 
boundary quadtree indexing cells of each region are approximately equal. Thus, region 1 will be 
represented by 1 interior and 1 boundary cells; likewise, region 2 will be represented by J interior 
cells and J boundary cells. In addition, with no loss of generality, we assume that m ^ n. 
Based on these assumptions and using the search-based approach for evaluating cell interactions 
(the region with the lowest cell count controls the number of required operations), the worst case 
computational complexity for the three classes of cell intersection are as follows: 
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Interior cell x interior cell 



(I) 



Interior cell x boundary cell 



(I) 



Boundary cell x boundary cell 



(I) x (m/I x n/J) = (m x n) / J 



The latter expression arises because, for each boundary cell in the physically smaller of 
the two regions, all possible line segment intersections between the piecewise continuous line 
segments associated with two features must be computed. 

If I «m n, it becomes readily apparent that (m x n) / J » I. Thus, quite clearly, the 
(boundary cell) x (boundary cell) component of the computation (sub problem 3) dominates the 
complexity of the present approach. Therefore, the computational complexity of the VGS 
approach is on the order of (m x n)/ J and the "worst case" relative efficiency (with respect to a 
computational geometry-based approach), Er, is 

Er = (m X n) / (m X n) /J = J = n / nj 

In other words, the relative efficiency of the present approach with respect to the 
conventional computational geometry approach is equal to the number of indexing cells in the 
larger of the two regions (i.e., J). Equivalently, the relative efficiency is proportional to the 
number of tuples in the larger of the two regions (i .e., n). As the "size" of the larger region 
increases, so does the relative efficiency. If the number of points in the larger region is 
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increased by two orders of magnitude, for instance, the relative efficiency will increase by 
approximately that same proportion. 

When little overlap exists between the two regions (such as in Fig. 30(a)), the number of 
boundary cells (and their associated tuples) is much less than 1. Consequently, overall 
performance will be better than the above performance bound predicts. 

The "worst-case" performance of the present approach occurs when all boundary cells of 
the two regions intersect. In this situation, I = J and nj can be replaced by n/L Under these 
conditions, the "worst-case" performance improvement reduces to 

Er = n/n/l = I 

As a simple example, suppose I =100. Thus, the "worst case" relative efficiency 
improvement over traditional computational geometry algorithms is on the order of 100 (i.e., I ^ 
100). If the indexing cell size was further refined so that both regions contained 1000 boundary 
cells (i.e., I 1000), the present approach could be as much as one thousand times more efficient 
than traditional computational geometry-based intersection algorithms. 

Performance tests conducted using the software implementation of the generation 
methodology corroborate these theoretically derived performance bounds. 

Having now fully set forth the preferred embodiments and certain modifications of the 
concept underlying the present invention, various other embodiments as well as certain 
variations and modifications of the embodiments herein shown and described will obviously 
occur to those skilled in the art upon becoming familiar with said underlying concept. It is to be 
understood, therefore, that the invention may be practiced otherwise than as specifically set forth 
herein. 



